Picture for Jieyu Zhao

Jieyu Zhao

Mitigating Bias in Locally Constrained Decoding via Tractable Proposals

Add code
Jun 01, 2026
Viaarxiv icon

Skill Reuse as Compression in Agentic RL

Add code
May 29, 2026
Viaarxiv icon

Converted, Not Equivalent: Benchmarking Codebase Conversion via Observational Equivalence

Add code
May 27, 2026
Viaarxiv icon

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Add code
Apr 12, 2026
Viaarxiv icon

Video-Based Reward Modeling for Computer-Use Agents

Add code
Mar 10, 2026
Viaarxiv icon

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

Add code
Feb 28, 2026
Viaarxiv icon

Experiential Reinforcement Learning

Add code
Feb 15, 2026
Viaarxiv icon

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Figure 1 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 2 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 3 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 4 for CoAct-1: Computer-using Agents with Coding as Actions
Viaarxiv icon

Can LLMs Express Personality Across Cultures? Introducing CulturalPersonas for Evaluating Trait Alignment

Add code
Jun 06, 2025
Viaarxiv icon

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Add code
May 29, 2025
Figure 1 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 2 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 3 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 4 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Viaarxiv icon